转移学习(TL)利用以前获得的知识有效地学习新任务,并且已被用于培训具有有限数量的数据的深度学习(DL)模型。当TL应用于DL时,佩带的预押(教师)模型是微调的,以构建特定域(学生)模型。这种微调依赖于DL模型可以分解到分类器和特征提取器,并且一系列研究表明,相同的特征提取器可用于培训多个任务上的分类器。此外,最近的研究提出了多种算法,可以进行微调教师模型的特征提取器,以更有效地培训学生模型。我们注意到,无论特征提取器的微调如何,学生模型的分类器都接受了特征提取器的最终输出(即倒数第二层的输出)。然而,最近的一项研究表明,跨层中的Resnet中的特征映射可能是在功能上等同的,提高要素提取器内的特征映射的可能性也可用于训练学生模型的分类器。灵感来自这项研究,我们测试了教师模型隐藏层中的特征映射,可用于提高学生模型的准确性(即,TL的效率)。具体而言,我们开发了“自适应传输学习(ATL)”,可以选择用于TL的最佳特征映射,并在几次拍摄的学习设置中测试。我们的实证评估表明,ATL可以帮助DL模型更有效地学习,特别是当可用示例有限时。
translated by 谷歌翻译
Edge computing-based 3D perception has received attention in intelligent transportation systems (ITS) because real-time monitoring of traffic candidates potentially strengthens Vehicle-to-Everything (V2X) orchestration. Thanks to the capability of precisely measuring the depth information on surroundings from LiDAR, the increasing studies focus on lidar-based 3D detection, which significantly promotes the development of 3D perception. Few methods met the real-time requirement of edge deployment because of high computation-intensive operations. Moreover, an inconsistency problem of object detection remains uncovered in the pointcloud domain due to large sparsity. This paper thoroughly analyses this problem, comprehensively roused by recent works on determining inconsistency problems in the image specialisation. Therefore, we proposed a 3D harmonic loss function to relieve the pointcloud based inconsistent predictions. Moreover, the feasibility of 3D harmonic loss is demonstrated from a mathematical optimization perspective. The KITTI dataset and DAIR-V2X-I dataset are used for simulations, and our proposed method considerably improves the performance than benchmark models. Further, the simulative deployment on an edge device (Jetson Xavier TX) validates our proposed model's efficiency. Our code is open-source and publicly available.
translated by 谷歌翻译
通常,大型数据集使深度学习模型能够以良好的准确性和可推广性能。但是,大规模的高保真仿真数据集(来自分子化学,天体物理学,计算流体动力学(CFD)等,由于维度和存储限制,策划的策划可能具有挑战性。损失的压缩算法可以帮助减轻存储的限制,只要很长时间保留了总体数据保真度。为了说明这一点,我们证明了对佩斯卡尔CFD模拟的数据进行了训练和测试的深度学习模型,对在语义细分问题中有损耗的压缩期间引入的错误是可靠的。我们的结果表明,有损压缩算法提供了一种现实的途径,可以将高保真科学数据暴露到开放源数据存储库中,以构建社区数据集。在本文中,我们概述,构建和评估建立大数据框架的要求,在https:// bastnet上证明。 github.io/,用于科学机器学习。
translated by 谷歌翻译
在新兴应用中,自主机器人对日常生活的潜在影响是明显的,如精密农业,搜救,救援和基础设施检查。然而,这种应用需要在不明和复杂的一组目标中具有广泛而非结构化的环境,所有这些应用都在严格的计算和功率限制下。因此,我们认为必须安排和优化支持机器人自主权的计算内核,以保证及时和正确的行为,同时允许在运行时重新配置调度参数。在本文中,我们考虑了一个必要的第一步,迈出了自主机器人的计算意识的目标:从资源管理角度来看,基础计算内核的实证研究。具体地,我们对三个嵌入式计算平台进行了用于定位和映射,路径规划,任务分配,深度估计和光流的核的定时,电源和内存性能的数据驱动的研究。我们配置文件并分析这些内核,为计算感知自治机器人提供了解调度和动态资源管理的洞察。值得注意的是,我们的结果表明,内核性能与机器人的运营环境有关,证明了计算感知机器人的概念以及为什么我们的作品对这一目标的关键步骤。
translated by 谷歌翻译
有限的角度X射线断层扫描重建是一个不良反问题一般。特别是当投影角度有限并且在光子限制条件下进行测量时,来自经典算法的重建,例如过滤的反光,可能导致由于缺失的问题而获取伪影。为了获得令人满意的重建结果,通常在重建算法中结合在重建算法中的令人满意的重建结果,例如总变化最小化和非局部图像相似度。在这项工作中,我们介绍了深度神经网络,以确定并应用重建过程的先前分配。我们的神经网络直接从合成训练样本中学习。因此,神经网络获得了对我们对重建感兴趣的对象类的特定的先前分配。特别是,我们使用了具有3D卷积层和3D注意图层的深生成的模型,这些层在来自DubBed电路库的3D合成集成电路(IC)数据上培训。我们证明,当投影角度和光子预算受到限制时,来自我们深度生成模型的前沿可以显着提高合成数据的IC重建质量,而与最大似然估计相比。使用电路库的合成IC数据训练深度生成模型说明了从机器学习之前学到的学习功能。我们预计,如果使用实验数据再现过程,机器学习的优势将持续存在。机器学习在有限角X射线断层扫描的优点可以进一步能够在低光子纳米级成像中实现应用。
translated by 谷歌翻译
According to the rapid development of drone technologies, drones are widely used in many applications including military domains. In this paper, a novel situation-aware DRL- based autonomous nonlinear drone mobility control algorithm in cyber-physical loitering munition applications. On the battlefield, the design of DRL-based autonomous control algorithm is not straightforward because real-world data gathering is generally not available. Therefore, the approach in this paper is that cyber-physical virtual environment is constructed with Unity environment. Based on the virtual cyber-physical battlefield scenarios, a DRL-based automated nonlinear drone mobility control algorithm can be designed, evaluated, and visualized. Moreover, many obstacles exist which is harmful for linear trajectory control in real-world battlefield scenarios. Thus, our proposed autonomous nonlinear drone mobility control algorithm utilizes situation-aware components those are implemented with a Raycast function in Unity virtual scenarios. Based on the gathered situation-aware information, the drone can autonomously and nonlinearly adjust its trajectory during flight. Therefore, this approach is obviously beneficial for avoiding obstacles in obstacle-deployed battlefields. Our visualization-based performance evaluation shows that the proposed algorithm is superior from the other linear mobility control algorithms.
translated by 谷歌翻译
Participants in political discourse employ rhetorical strategies -- such as hedging, attributions, or denials -- to display varying degrees of belief commitments to claims proposed by themselves or others. Traditionally, political scientists have studied these epistemic phenomena through labor-intensive manual content analysis. We propose to help automate such work through epistemic stance prediction, drawn from research in computational semantics, to distinguish at the clausal level what is asserted, denied, or only ambivalently suggested by the author or other mentioned entities (belief holders). We first develop a simple RoBERTa-based model for multi-source stance predictions that outperforms more complex state-of-the-art modeling. Then we demonstrate its novel application to political science by conducting a large-scale analysis of the Mass Market Manifestos corpus of U.S. political opinion books, where we characterize trends in cited belief holders -- respected allies and opposed bogeymen -- across U.S. political ideologies.
translated by 谷歌翻译
Steering language generation towards objectives or away from undesired content has been a long-standing goal in utilizing language models (LM). Recent work has demonstrated reinforcement learning and weighted decoding as effective approaches to achieve a higher level of language control and quality with pros and cons. In this work, we propose a novel critic decoding method for controlled language generation (CriticControl) that combines the strengths of reinforcement learning and weighted decoding. Specifically, we adopt the actor-critic framework to train an LM-steering critic from non-differentiable reward models. And similar to weighted decoding, our method freezes the language model and manipulates the output token distribution using called critic, improving training efficiency and stability. Evaluation of our method on three controlled generation tasks, namely topic control, sentiment control, and detoxification, shows that our approach generates more coherent and well-controlled texts than previous methods. In addition, CriticControl demonstrates superior generalization ability in zero-shot settings. Human evaluation studies also corroborate our findings.
translated by 谷歌翻译
Task-oriented dialogue (TOD) systems are mainly based on the slot-filling-based TOD (SF-TOD) framework, in which dialogues are broken down into smaller, controllable units (i.e., slots) to fulfill a specific task. A series of approaches based on this framework achieved remarkable success on various TOD benchmarks. However, we argue that the current TOD benchmarks are limited to surrogate real-world scenarios and that the current TOD models are still a long way from unraveling the scenarios. In this position paper, we first identify current status and limitations of SF-TOD systems. After that, we explore the WebTOD framework, the alternative direction for building a scalable TOD system when a web/mobile interface is available. In WebTOD, the dialogue system learns how to understand the web/mobile interface that the human agent interacts with, powered by a large-scale language model.
translated by 谷歌翻译
Recent studies have proposed a unified user modeling framework that leverages user behavior data from various applications. Most benefit from utilizing users' behavior sequences as plain texts, representing rich information in any domain or system without losing generality. Hence, a question arises: Can language modeling for user history corpus help improve recommender systems? While its versatile usability has been widely investigated in many domains, its applications to recommender systems still remain underexplored. We show that language modeling applied directly to task-specific user histories achieves excellent results on diverse recommendation tasks. Also, leveraging additional task-agnostic user histories delivers significant performance benefits. We further demonstrate that our approach can provide promising transfer learning capabilities for a broad spectrum of real-world recommender systems, even on unseen domains and services.
translated by 谷歌翻译